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(54) Title: CONTROLLED AUGMENT OF NANO-BARCODES ENCODING SPECTHC INFORMAOON FOR SCANNING 
PROBE MICROSCOPY (SPM) READING 

(57) Abstract: The methods, apparatus and compositions disclosed herein concern the detection, identification and/or sequencing of 
biomolecules, such as nucleic acids or proteins. In certain embodiments of the invention, coded probes comprising a probe molecule 
attached to one or more nano-barcodes may be allowed to bind to one or more target molecules. After binding and separation from 
unbound coded probes, the bound coded probes may be aligned on a surface and analyzed by scanning probe microscopy. The 
nano-barcodes may beany molecule or complex that is distinguishable by SPM, such as carbon nanotubes, fullerenes, submicrometer 
metallic barcodes, nanoparticles or quantum dots. Where the probes are oligonucleotides, adjacent coded probes hybridized to a 
target nucleic acid may be ligated together before alignment and SPM analysis. Compositions comprising coded probes are also 
disclosed herein. Systems for biomolecule analysis may comprise an SPM instrument and at least one coded probe attached to a 
surface. 
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CONTROLLED ALIGNMENT OF NANO-BARCODES ENCODING SPECIFIC 
INFORMATION FOR SCANNING PROBE MICROSCOP Y fSPM> READING 

FIELD OF THE INVENTION 

5 [0001] Hie present methods, compositions and apparatus relate to the fields of molecular 
biology and analysis of biomolecules including, but not limited to, nucleic acids, proteins, 
lipids and polysaccharides. In particular, the invention relates to methods, compositions 
and apparatus for detection, identification and/or sequencing of nucleic acids and/or other 
biomolecules using nano-barcodes and scanning probe microscopy (SPM). 

10 BACKGROUND 

[0002] Identification and/or sequencing of biomolecules, such as nucleic acids or proteins, 
is essential for medical diagnostics, forensics, toxicology, pathology, biological warfare, 
public health and numerous other fields. Although a great deal of research is presently 
directed towards identification and/or sequencing of nucleic acids or proteins, other 
1 5 biomolecules s uch a s c arbohydrates, p olysaccharides, 1 ipids, fatty a cids, e /c. m ay b e o f 
importance. The methods, compositions and apparatus disclosed herein are not limited to 
identification and/or sequencing of nucleic acids, but are also of use for analysis of other 
types of biomolecules, including but not limited to proteins, hpids and polysaccharides. 

[0003] Standard methods for nucleic acid detection, such as Southern blottmg or binding 
20 to nucleic acid chips, rely on hybridization of a fluorescent or radioactive probe molecule 
with a target nucleic acid molecule. Known methods for nucleic acid sequencing typically 
utilize either the Sanger dideoxy technique or hybridization to nucleic acid chips. 

[0004] Oligonucleotide hybridization based assays are in wide use for detection of target 
nucleic acids. A probe oligonucleotide that is complementary in sequence to a target 

25 nucleic acid is attached to a fluorescent, radioactive or other moiety and allowed to 
hybridize to a nucleic acid through Watson-Crick base pair formation. Many variations on 
this technique are known. More recently, DNA chips have been designed that can contain 
hundreds or even thousands of oligonucleotide probes. Hybridization of a target nucleic 
acid to an oligonucleotide on a chip may be detected using fluorescence spectroscopy, 

30 radioactivity, etc. Problems with sensitivity and/or specificity may result firom nucleic 

1 
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acid hybridization between sequences that are not precisely complementary. The presence 
of low levels of a target nucleic acid in a sample may not be detected. 

[0005] Methods for Sanger dideoxy nucleic acid sequencing, based on detection of four- 
color fluorescent or radioactive nucleic acids that have been separated by size, are limited 
by the length of the nucleic acid that can be sequenced. Typically, only 500 to 1,000 bases 
of nucleic acid sequence can be determined at one time. Using current methods, 
determination of a complete gene sequence requires that many copies of the gene be 
produced, cut into overlapping fragments and sequenced, after which the overlapping 
DNA sequences may be assembled. This process is laborious, expensive, inefficient and 
time-consxraiing. It also typically requires the use of fluorescent or radioactive moieties, 
which can potentially pose safety and waste disposal problems. More recent metiiods for 
nucleic acid sequencing using hybridization to ohgonucleotide chips may be used to infer 
short nucleic acid sequences or to detect the presence of a specific nucleic acid in a 
sample, but are not suited for identifying long nucleic acid sequenceis. 

[00061 A variety of techniques are available for identification of proteins, polypeptides 
and peptides, Conunonly, these involve binding and detection of antibodies that can 
recognize one or more epitopic domains on the protein. Although antibody-based 
identification of proteins is fairly rapid, such assays may occasionally show unacceptably 
high levels of false positive or false negative results, due to cross-reactivity of the antibody 
with different antigens, low antigenicity of the target analyte (leading to low sensitivity of 
the assay), non-specific binding of antibody to various surfaces, etc. They also reqmre the 
preparation of antibodies that can recognize an individual protein or peptide. As such, 
they are not suitable for the identification of novel proteins that have not previously been 
characterized. 

[0007] A need exists for rapid, accurate and sensitive methods for detection, identification 
and/or sequencing of biomolecules, such as nucleic acids or proteins. 

BRIEF DESCRIPTION OF THR DRAWINGS 

[0008] The following drawings form part of the present specification and are included to 
fiulher demonstrate certain aspects of the disclosed embodiments of the invention. The 
embodiments of the invention may be better understood by reference to one or more of 
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these drawings in combination with the detailed description of specific embodiments 
presented herein. 

[00091 FIG. 1 illustrates an exemplary method for aligning coded probes 130, each 
comprising one or more nano-barcodes attached to a probe molecule on a surface 100. (A) 
5 Immersion of a surface 100 into a solution 110 containing coded probes 130. (B) 
Removal of the surface 100 containing aUgned coded probes 130 firom solution 110. 

[0010] FIG. 2 illustrates an alternative exemplary method for aligning coded probes 230 
on a surface 220. (A) A drop of solution 210 containing coded probes 230 is sandwiched 
between a cover slip 200 and a glass sUde 220. While the cover slip 200 is held in place, 
10 the slide 220 is moved, resulting in alignment of the coded probes 230. 

[0011] FIG. 3 illustrates another alternative exemplary method for aUgning coded probes 
340 on a surface 300. 

[0012] FIG. 4 illustrates an exemplary coded probe 400, comprising a nano-barcode 420 
attached to a probe molecule 410. An individual nano-barcode 420 may be comprised of 
1 5 one or more moieties, as discussed in more detail below. 

[0013] FIG. 5 shows an exemplary scheme for synthesis of coded probes. (A) Conversion 
of exemplary nano-tag element into a bi-functional molecule containing Rl and R2 
functional moieties. (B) Protection of one functional moiety and activation of the other. 
(Q Stepwise addition of building blocks in a controlled polymerization. 

20 [0014] FIG. 6 illustrates a general scheme for backbone mediated nano-barcode synthesis. 
(A) Monofunctionalization of tag unit (B) Conversion to amino acid analog. (C) 
Conversion to nucleotide analog. 

[0015] FIG. 7 shows an exemplary modification of a bi-functional fuUerene diol for 
incorporation into a coded probe. 

25 [0016] FIG. 8 shows an exemplary structure of a bi-functional fuUerene of use in coded 
probe synthesis. 

[0017] FIG. 9 shows an exemplary image of digested lambda DNA obtained by atomic 
force microscopy. 

3 
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[0018] FIG. 10 shows an example of DNA molecules aligned by microfluidic molecular 
combing (MMC). 

[0019] FIG. 11 shows another example of DNA molecules aligned by microfluidic 
molecular combing (MMC). 

[0020] FIG. 12 illustrates an exemplary oligonucleotide based nano-barcode made up of 
13 individual oligonucleotide strands hybridized together. 

[0021] FIG. 13 shows the individual oHgonucleotide components of the nano-barcode of 
FIG. 12. Note that, as shown in FIG. 12, there are 9 fragments (labeled PTl to PT9, in 
order) used to make the top strand of the nano-barcode and 4 fragments (labeled #1 to #4) 
used to make the bottom strand. The hybridized nano-barcode exhibits branch points 
detectable by scanning probe microscopy. 

[0022] FIG. 14 lists the complete sequences of PTl through PT9, including the branch 
points. 

[0023] FIG. 15 shows the nano-barcode of FIG. 12 and FIG. 13, imaged by atomic force 
microscopy (arrow, top right of Figure). For comparison, a 2.8 kb linearized plasmid 
DNA is also shown. 

DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

[0024] The disclosed methods, compositions and apparatus are of use for detection, 
identification and/or sequencing of biomolecules, such as nucleic acids. In particular 
20 embodiments of the invention, the methods, compositions and apparatus are suitable for 
obtaining the sequences of very long nucleic acid molecules of greater than 1,000, greater 
than 2,000, greater than 5,000, greater than 10,000 greater than 20,000, greater than 
50,000, greater than 100,000 or even more bases in length. Advantages include the ability 
to read long nucleic acid sequences in a single sequencing run, high speed of obtaining 
25 sequence data, low cost of sequencing and high efficiency in terms of the amoxmt of 
operator time required per imit of sequence data. Other advantages include the sensitive 
and accurate detection and/or identification of nucleic acids, with low incidence of false 
positive results. 

[0025] The following detailed description contains numerous specific details in order to 
30 provide a m ore t borough u nderstanding o f t he d isclosed embodiments o f the invention. 

4 
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However, it will be apparent to those skilled in the art that the embodiments of the 
invention may be practiced without these specific details. In other instances, devices, 
methods, procedures, and individual components that are well known in the art have not 
been described in detail herein. 

S Definitions 

[0026] As used herein, "a'* or "an" may mean one or more than one of an item. 

[0027] As used herein, "about" means within ten percent of a value. For example, "about 
100" would mean a value between 90 and 110. 

[0028] "Nucleic acid" encompasses DNA, RNA (ribonucleic acid), single-stranded, 
10 double-stranded or triple stranded and any chemical modifications tiiereof. Virtually any 
modification of the nucleic acid is contemplated. A "nucleic acid" may be of almost any 
length, from oligonucleotides of 2 or more bases up to a fiiU-length chromosomal DNA 
molecule. Nucleic acids include, but are not limited to, oligonucleotides and 
polynucleotides. 

15 [0029] "Coded probe" refers to a probe molecule attached to one or more nano-barcodes. 
A probe molecule is any molecule that exhibits selective and/or specific binding to one or 
more target molecules. In various embodiments of the invention, each different probe 
molecule may be attached to a distinguishable nano-barcode, so that binding of a 
particular p robe from a population o f different probe m olecules m ay b e d etected. T he 

20 embodiments of the invention are not limited as to the type of probe molecules that may 
be used. Any probe molecule known in the art, including but not limited to 
oligonucleotides, nucleic acids, antibodies, antibody fragments, binding proteins, receptor 
proteins, peptides, lectins, substrates, inhibitors, activators, ligands, hormones, cytokines, 
etc. may be used, hi certain embodiments of the invention, coded probes may comprise 

25 oligonucleotides and/or nucleic acids that have been covalently or non-covalently attached 
to one or more nano-barcodes that identify the sequence of the oUgonucleotide and/or 
nucleic acid. In various embodiments of the invention, a linear series of coded probes may 
be ligated together. Each coded probe in the ligated molecule may be attached to a 
distinguishable nano-barcode to allow identification of its sequence. Since flie sequence 

30 of coded probes in a ligated molecule may also be determined, the sequence of the entire 
ligated molecule may be identified- In alternative embodiments, each nucleotide within an 

5 
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oligonucleotide probe may be attached to a distinguishable nano-barcode, allowing the 
sequence of the coded probe to be identified from the sequence of nucleotides. 

[0030] "Nano-barcode" refers to a composition that may be used to detect and/or identify 
a coded probe. In non-limiting examples discussed in more detail below, a nano-barcode 
5 may comprise one or more submicrometer metallic barcodes, carbon nanotubes, fiillerenes 
or any other nanoscale moiety that may be detected and identified by scanning probe 
microscopy. Nano-barcodes are not limited to single moieties and in certain embodiments 
of the invention a nano-barcode may comprise, for example, two or more fiillerenes 
attached to each other. Fullerenes for example may consist of a series of large and small 
10 fullerenes attached together in a specific order. The order of differently sized fullerenes in 
a nano-barcode may be detected by scanning probe microscopy and used, for example, to 
identify the sequence of an attached oligonucleotide probe. 

[0031] A "target" or "analyte" molecule is any molecule that may bind to a coded probe, 
including but not limited to nucleic acids, proteins, lipids and polysaccharides. In some 
15 embodiments of the invention, binding of a coded probe to a target molecule may be used 
to detect the presence of the target molecule in a sample. 

Molecular Combing 

[0032] In various embodiments of the invention, nano-barcodes, coded probes and/or 
target molecules boimd to coded probes may be attached to a surface and aligned for 

20 analysis. Aligmnent of the coded probes provides for an increased accuracy and/or speed 
of coded probe identification. Coded probes or nano-barcodes that are placed upon a 
surface in a disorganized pattern may overly w ith each other or be partially obscured, 
complicating their detection and/or identification. In some embodiments, coded probes 
may be aligned on a surface and the incorporated nano-barcodes detected as discussed 

25 below. In alternative embodiments, nano-barcodes may be detached from the probe 
molecules, aligned on a surface and detected. In certain embodiments, the order of coded 
probes bound to an individual target molecule may be retained and detected, for example, 
by scanning probe microscopy. In other embodiments, multiple copies of a target 
molecule may be present in a sample and the identity and/or sequence of the target 

30 molecule may be detemiined by assembling all of the sequences of coded probes binding 
to the multiple copies into an overl^ping target molecule sequence. Methods for 

6 
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assembling, for example, overlapping partial nucleic acid or protein sequences into a 
contiguous sequence are known in the art. In various embodiments, nano-barcodes may 
be detected while lliey are attached to probe molecules, or may alternatively be detached 
from the probe molecules before detection. 
5 [0033] Methods and apparatus for attachment to surfaces and alignment of molecules, 
such as nucleic acids, oligonucleotide probes and/or nano-barcodes are known in the art. 
(See, e.g., Bensimon et a/., Phys. Rev. Lett. 74:4754-57, 1995; Michalet et al. Science 
277:1518-23, 1997; U.S. Patent Nos. 5,840,862; 6,054,327; 6,225,055; 6,248,537; 
6,265,153; 6,303,296 and 6,344,319.) Nano-barcodes, coded probes and/or target 

10 molecules may be attached to a surface and aUgned using physical forces inherent in an 
air-water meniscus or other types of interfaces. This technique is generally known as 
molecular combing. Nano-barcodes, coded probes and/or target molecules dissolved in an 
aqueous medium may be attached at either one or both ends to a surface, such as a 
silanized glass slide, a biotinylated surface, a gold-coated surface or any other surface 

15 known in the art capable of binding such molecules. The surface may be slowly 
withdrawn from tiie aqueous medium. Polar or charged target molecules, nano-barcodes, 
and/or coded probe molecules will preferentially partition into the hydrophilic (aqueous) 
medium. Thus, removal of the surface from the aqueous medium results in stretching of 
the bound target molecules, nano-barcodes and/or coded probes, parallel to the direction of 

20 movement of the meniscus. There is a direct correlation between the measured length of 
the stretched molecule and its actual size, with 1 pm of stretched length conresponding to 
about 2,000 bases of nucleic acid sequence (Herrick et aL^ Proc. Natl. Acad. Sci. USA 
97:222-227, 2000). 

[0034] Once the surface has been entirely removed from the aqueous medium, the 
25 attached nano-barcodes and/or coded probes are aligned in a parallel fashion that may be 
more easily and accurately analyzed. In certain embodiments of the invention where both 
ends of a coded probe are attached to the surface, the aligned coded probes will be 
arranged in a U-shaped conformation that is also more easily analyzed. The technique is 
not limited by the size of the target molecules, nano-barcodes and/or coded probes to be 
30 aligned, and can woik on nucleic acids as long as whole chromosomes (e.^., Michalet et 
aL, 1997; Herrick et al, 2000). At appropriate rates of movement of the meniscus the 

7 
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shear forces generated are relatively low, resulting in aligned DNA fragments of several 
himdred kilobases or longer (Michalet et aL, 1997). 

[00351 Molecular combing is inhibited by strong nonspecific adsorption of molecules to 
the treated surface (Bensimon et aL, 1995). Thus, in various embodiments of the 
5 invention, the surface is treated so that only one or more ends of a target molecule or 
coded probe will bind to the surface. Methods for binding nucleic acids and other types of 
coded probes to surfaces are well knovra in the art and are summarized below. Li a non- 
limiting example, target molecules, nano-barcodes or coded probes may be covalently 
modified with biotin residues at one or both ends of the molecule. Upon exposure to an 
10 avidin or streptavidin coated surface, only the biotinylated ends will bind to the surface. 
Nonspecific adsorption to a surface may be decreased by the use of surfaces that are 
hydrophobic in nature, such as silanized surfaces. 

[0036] The embodiments of the invention are not limited by the type of surface that may 
be used. Non-limiting examples of surfaces include glass, fimctionalized glass, ceramic, 

15 plastic, polystyrene, polypropylene, polyethylene, polycarbonate, PTFE 
(polytetrafluoroethylene), PVP (polyvinylpyrrolidone), germanium, silicon, quartz, 
gallium arsenide, gold, silver, nylon, nitrocellulose or any other material known in the art 
that is capable of having target molecules, nano-barcodes and/or coded probes attached to 
the surface. Attachment may be either by covalent or noncovalent interaction. Although 

20 in certain embodiments of the invention the surface is in the form of a glass slide or cover 
sUp, the shape of the surface is not limiting and the surface may be in any shape. In some 
embodiments of the invention, the surface is planar. 

[0037] Altemative methods for aligning target molecules, nano-barcodes and/or coded 
probes on surfaces are known in the art. {Kg,, Bensimon et aL, 1995; Michalet et aly 

25 1997; U.S. Patent Nos. 5,840,862; 6,054,327; 6,225,055; 6,248,537; 6,265,153; 6,303,296 
and 6,344,319). It is contemplated that any known method of ahgnment may be used 
within the scope of the claimed subject matter. In certain ^bodiments of the invention, 
alignment occurs when target molecules, nano-barcodes or coded probes dissolved in an 
aqueous medium are drawn through a moving meniscus. The mechanism by which the 

30 meniscus is moved is not important and may be accomplished, for example, by inmiersing 
a surface in bufifer solution and slowly withdrawing it from the solution. Alternatively, a 
surface may be immersed in a solution and the level of the meniscus may be slowly 

8 



BNSOCX;tD: <W O g QO40aaO37A2 I > 



wo 2004/038037 



PCT/US2003/029726 



lowered by evaporation or by removal of liqmd. In another alternative embodiment of the 
invention, a drop of solution may be placed between a cover slip and a surface, such as a 
glass slide. The surface may be slowly pulled away from the cover slip. Because the 
solution adheres to the cover slip, this results in the formation of an air-water interface at 

5 the edge where the cover sUp contacts the surface. Moving this intCTface aligns the target 
molecules, nano-barcodes and/or coded probes on the surface. Another alternative method 
for aligning nano-barcodes and/or coded probes, discussed in more detail below, involves 
use of free-flow electrophoresis either in place of or during molecular combing. 
Altematively, coded probes and/or nano-barcodes may be aUgned by microfluidic 

1 0 molecular combing, as discussed in the Examples below. 

Nucleic Acids 

[0038] Nucleic acid molecules to be detected, identified and/or sequenced may be 
prepared by any technique known in the art. In certain embodiments of the invention, the 
nucleic acids are n aturally o cciuxing D NA o r RNA m olecules. V irtually any n aturally 

15 occurring nucleic acid may be detected, identified and/or sequenced by the disclosed 
methods including, without limit, chromosomal, mitochondrial and chloroplast DNA and 
ribosomal, transfer, heterogeneous nuclear and messenger RNA. In some embodiments, 
the nucleic acids to be analyzed may be present in cmde homogenates or extracts of cells, 
tissues or organs. In other embodiments, the nucleic acids may be partially or fully 

20 purified before analysis. In alternative embodiments, the nucleic acid molecules to be 
analyzed may be prepared by chemical synthesis or by a wide variety of nucleic acid 
amplification, replication and/or synthetic methods known in the art 

[0039] Methods for p urifying v arious f orms o f c ellular nucleic a cids are known. ( See, 
e.g.. Guide to Molecular Cloning Techniques, eds. Berger and Elinmiel, Academic Press, 

25 New York, NY, 1987; Molecular Cloning: A Laboratory Manual, 2nd Ed., eds. 
Sambrook, Fritsch and Maniatis, Cold Spring Harbor Press, Cold Spring Harbor, NY, 
1989). The methods disclosed in the cited references are exemplary only and any 
variation known in the art may be used. In cases where single stranded DNA (ssDNA) is 
to be analyzed, ssDNA may be prepared fi^om double stranded DNA (dsDNA) by any 

30 known method. Such methods may involve heating dsDNA and allowing the strands to 
separate, or may altematively involve preparation of ssDNA firom dsDNA by known 

9 
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amplification or replication methods, such as cloning into Ml 3. Any such known method 
may be used to prepare ssDNA or ssRNA. 

[0040] Although certain embodiments of the invention concern analysis of naturally 
occurring nucleic acids, virtually any type of nucleic acid could be used. For example, 
5 nucleic acids prepared by various amplification techniques, such as polymerase chain 
reaction (PGR™) amplification, could be analyzed. (See U.S. Patent Nos. 4,683,195, 
4,683,202 and 4,800,159.) Nucleic acids to be analyzed may alternatively be cloned in 
standard vectors, such as plasmids, cosmids, BACs (bacterial artificial chromosomes) or 
YACs (yeast artificial chromosomes). (See, e,g,, Berger and Kimmel, 1987; Sambrook et al, 

10 1989.) Nucleic acid inserts may be isolated fi^om vector DNA, for example, by excision wilh 
appropriate restriction endonucleases, followed by agarose gel electrophoresis. Methods for 
isolation of nucleic acid inserts are known in the art. The disclosed methods are not limited 
as to the source of the nucleic acid to be analyzed and any type of nucleic acid, including 
prokaryotic, bacterial, viral, eukaryotic, mammalian and/or human may be analyzed within 

15 the scope ofthe claimed subject matter. 

[0041] In various embodiments of the invention, multiple copies of a single nucleic acid may 
be analyzed by coded probe hybridization, as discussed below. Preparation of single nucleic 
acids and formation of multiple copies, for example by various amplification and/or 
rq)lication methods, are known in the art A Itematively, a single clone, such as a B AC, 
20 YAC, plasmid, virus, or other vector that contains a single nucleic acid insert may be 
isolated, grown \sp and the insert removed and purified for analysis. Methods for cloning and 
obtaining purified nucleic acid inserts are well known in the art. 

[0042] The s killed a rtisan w ill r ealize t hat t he s cope o f t he c laimed s ubj ect m atta- i s n ot 
limited to analysis of nucleic acids, but also concems analysis of other types of biomolecules, 
25 including but not limited to proteins, hpids and polysaccharides. Methods for preparing 
and/or purifying various types of biomolecules are known in the art and any such method 
may be used. 

Coded Probe Libraries 

[0043] In certain embodiments of the invention, coded probes may comprise a library of 
30 probe molecules, each different probe attached to a distinguishable nano-barcode. Within 
a given library, it is possible that there may be more than one copy of a specific probe 

10 
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molecule. In this case, each copy of the same probe would be attached to aa identical 
nano-barcode. The types of probes and nano-barcodes used are not limiting and any 
known type of probe molecule, including but not limited to oligonucleotides, nucleic 
acids, antibodies, antibody fragments, binding proteins, receptor proteins, peptides, lectins, 
5 substrates, inhibitors, activators, ligands, hormones, cytokines, etc. may be used. Further, 
any type of distinguishable nano-barcode may be used. 

Oligonucleotide Libraries 

[0044] In various embodiments of the invention, the coded probes may comprise 
oligonucleotide probes, such as oligonucleotides of defined sequence. The 

10 oligonucleotides may be attached to distinguishable nano-barcodes, hybridized to a nucleic 
acid to be analyzed and adjacent coded probes ligated together. After separation fix)m the 
nucleic acid, the ligated coded probes may be attached to a surface and aligned, as 
discussed above. The aUgned coded probes may then be analyzed by scanning probe 
microscopy (SPM). SPM analysis allows detection and identification of the nano-barcode 

15 component of coded probes and determination of the sequence of coded probes binding to 
the nucleic acid. That information can be used to identify the nucleic acid and/or to 
determine the nucleic acid sequence. The skilled artisan will realize that the claimed 
subject matter is not limited to SPM detection methods, and any method of analysis that 
can detect and identify nano-barcodes and/or coded probes aligned on a surface may be 

20 used. The skilled artisan will also realize that SPM analysis is not limited to detection and 
identification of oligonucleotide-based coded probes, but may be used with any type of 
coded probe and/or nano-barcode. 

[0045] In alternative embodiments of the invention, coded probes may be detected without 
ligation of adjacent coded probes. The coded probes may be hybridized to multiple copies 

25 of the same target molecule. Non-hybridized coded probes may be removed and the 
hybridized coded probes detected. In some embodiments, coded probes may be detected 
while still hybridized to target molecules. Alternatively, coded probes may be detached 
fix)m the target molecules, for example by heating the sample, and then detected. In such 
embodiments, the nano-barcode component may or may not be removed from the probe 

30 component of the coded probes before detection. 
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[0046] In certain embodiments of the invention, coded probes may be detected while still 
attached to a target molecule. Given the relatively weak strength of the binding interaction 
between short oligonucleotide probes and target nucleic acids, such methods may be more 
appropriate where, for example, coded probes have been covalently attached to the target 
5 molecule using cross-linking reagents, or where the binding interaction between probe 
molecule and target is stronger, as with antibody-antigen interactions. 

[0047] In various embodiments of the invention, oligonucleotide type coded probes may 
be DNA, RNA, or any analog thereof, such as peptide nucleic acid (PNA), which can be 
used to identify a specific complementary sequence in a nucleic acid. In certain 

10 embodiments of the invention one or more coded probe libraries may be prepared for 
hybridization to one or more nucleic acid molecules. For example, a set of coded probes 
containing all 4096 or about 2000 non-complementary 6-mers, or all 16,384 or about 
8,000 non-complementary 7-mers may be used. If non-complementary subsets of 
oligonucleotide coded probes are to be used, a plurality of hybridizations and sequence 

15 analyses may be carried out and the results of the analyses merged into a single data set by 
computational methods. For example, if a library comprising only non-complementary 6- 
mers were used for hybridization and sequence analysis, a second hybridization and 
analysis using the same target nucleic acid molecule hybridized to those coded probe 
sequences excluded from the first library may be performed. 

20 [0048] In s ome embodiments o f t he i nvention, t he c oded p robe 1 ibrary m ay c ontain a 11 
possible sequences for a given oligonucleotide length {e,g,^ a six-mer library would consist 
of 4096 coded probes). In such cases, certain coded probes will form hybrids with 
complementary coded probe sequences. Such hybrids, as well as unhybridized coded 
probes, may be separated from coded probes hybridized to the target molecule using 

25 known methods, such as high performance liquid chromatography (HPLC), gel 
permeation chromatography, gel electrophoresis, ultrafiltration and/or hydroxylapatite 
chromatography. Methods for the selection and generation of complete sets or specific 
subsets o f o ligonucleotides o f a 11 p ossible s equences f or a g iven 1 ength a re k nown. In 
various embodiments, coded probes of 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 

30 18, 19, 20, 21, 22, 23, 24, 25 or more nucleotides in length may be used. 
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[0049] In certain embodiments of the invention, the coded probe libraries may comprise a 
random nucleic acid sequence in the middle of ttie coded probe attached to constant 
nucleic acid sequences at one or both ends. For example, a subset of 12-mer coded probes 
could consist of a complete set of random 8-mer sequences attached to constant 2-mers at 
5 each end. These coded probe libraries can be subdivided according to their constant 
portions and hybridized separately to a nucleic acid, followed by analysis using the 
combined data of each different coded probe library to determine the nucleic acid 
sequence. The skilled artisan will reaUze that the number of sublibraries required is a 
function of the number of constant bases that are attached to the random sequences. An 

10 alternative embodiment may use multiple hybridizations and analyses with a single coded 
probe library containing a specific constant portion attached to random oligonucleotide 
sequences. For any given site on a nucleic acid, it is possible that multiple coded probes 
of different, but overlapping sequence could bind to that site in a slightly of&et manner. 
Thus, using multiple hybridizations and analyses with a single library, a complete 

15 sequence of the nucleic acid could be obtained by compiling the overlapping, offset coded 
probe sequences. 

[0050] In embodiments of the invention involving oligonucleotide libraries, 
oligonucleotides may be prepared by any known method, such as by synthesis on an 
Apphed Biosystems 381 A DNA synthesizer (Foster City, CA) or similar instruments. 

20 Alternatively, oligonucleotides can be purchased from a variety of vendors (e.g., ProUgo, 
Boulder, CO; Midland Certified Reagents, Midland, TX). In embodiments where 
oligonucleotides are chemically synthesized, the nano-barcodes may be covalently 
attached to one or more of the nucleotide precursors used for sjmthesis. Alternatively, the 
nano-barcode may be attached after the oligonucleotide probe has been synthesized. In 

25 other altematives, the nano-barcode(s) may be attached concurrently with oligonucleotide 
synthesis. 

[0051] In certain embodimrats of the invention, coded probes may comprise peptide 
nucleic acids (PNAs). PNAs are a polyamide type of DNA analog with monomeric units 
for adenine (A), guanine (G), thymine (T), and cytosine (C). PNAs are commercially 
30 available from companies such as PE Biosystmis (Foster City, CA). Alternatively, PNA 
synthesis may be perfomied with 9-fluoroenylmethoxycarbonyl (Fmoc) monomer 
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activation and coupling using 0-<7-azabenzotriazol-l"yl)-l,13,3-tetramethyliironiunci 
hexailuorophosphate (HATU) in the presence of a tertiary amine, NJ^- 
diisopropylethylamine (DIEA). PNAs can be purified by reverse phase high performance 
liquid chromatography (RP-HPLC) and verified by matrix assisted laser desorption 
5 ionization - time of flight (MALDI-TOF) mass spectrometry analysis. 

Nano-barcodes 

[0052] Each coded probe may incorporate at least one covalently or non-covalently 
attached nano-barcode. The nano-barcodes may be used to detect and/or identify 
individual coded probes. In certain embodiments of the invention each coded probe may 

10 have two or more attached nano-barcodes, the combination of which is imique to a 
particular coded probe. Combinations of nano-barcodes can be used to expand the number 
of distinguishable nano-barcodes available for specifically identifying a coded probe in a 
library. In other embodiments of the invention, the coded probes may each have a single 
unique nano-barcode attached. The only requirement is that the signed detected &om each 

15 coded probe must be capable of distinguishably identifying that coded probe fi-om 
different coded probes. 

[0053] In certain embodiments of the invention, a nano-barcode may be incorporated into 
a precursor prior to the synthesis of a coded probe. For oligonucleotide-based coded 
probes, internal amino-modifications for covalent attachment at adenine (A) and guanine 

20 (G) positions are contemplated. Intemal attachment may also be performed at a thymine 
(T) position using a commercially available phosphoramidite. In some embodiments 
library segments with a propylamine linker at the A and G positions may be used to attach 
nano-barcodes toe oded probes. The i ntroduction o f an i ntemal aminoalkyl t ail allows 
post-synthetic attachment of the nano-barcode. Linkers may be pxu"chased fi-om vendors 

25 such as Synthetic Genetics (San Diego, CA). In one embodiment of the invention, 
automatic coupling using the appropriate phosphoramidite derivative of the nano-barcode 
is also contemplated. Such nano-barcodes may be coupled to the 5 -terminus during 
oligonucleotide synthesis. 

[0054] In general, nano-barcodes will be covalently attached to the probe in such a 
30 manner as to minimize steric hindrance with the nano-barcodes, in order to facilitate coded 
probe binding to a target moleciile, such as hybridization to a nucleic acid. Linkers may 
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be used that provide a degree of flexibility to the coded probe. Homo-or hetero- 
bifiinctional linkers are available from various commercial sources. 

[00551 The point of attachment to an oligonucleotide base will vary with the base. While 
attachment at any position is possible, in certain embodiments attachment occurs at 
5 positions not involved in hydrogen bonding to the complementary base. Thus, for 
example, attachment can be to the 5 or 6 positions of pyrimidines such as uridine, cytosine 
and thymine. For purines such as adenine and guanine, the linkage is may be via the 8 
position. The claimed methods and compositions are not limited to any particular type of 
probe molecule, such as oligonucleotides. Methods for attachment of nano-barcodes to 
10 other types of probes, such as pqptide, protein and/or antibody probes, are known in the 
art. 

[0056] The embodiments of the invention are not lintiiting as to the type of nano*barcode 
that may be used. It is contemplated that any type of nano-barcode known in the art may 
be used. Non-limiting examples include carbon nanotubes, fuUerenes and submicrometer 
1 5 metallic barcodes. 

Metallic Barcodes 

[0057] Examples of submicrometer metallic barcodes of potential use as nano-barcodes 

are known in the art {e.g., Nicewamer-Pena et al. Science 294:137-141, 2001). 

Nicewamer-Pena et al. (2001) disclose methods of preparing multimetal microrods 
20 encoded with submicrometer stripes, comprised of different types of metal. This system 

allows for the production of a very large nximber of distinguishable nanbarcodes - up to 

4160 using two types of metal and as many as 8 x 10^ with three different types of metal. 

Such nano-barcodes may be incorporated into coded probes and read by SPM technology. 

Methods of attaching metal particles, such as gold or silver, to oligonucleotide and other 
25 types of probe molecules are known in the art U.S. Patent No. 5,472,881). Metallic 

nanobarcodes™ may be obtained from commercial sources {e,g., Nanoplex Technologies, 

Mountain View, CA). 

Quantum Dot Microbeads 

[0058] Nano-barcodes may also comprise quantum dot tagged mio-obeads, as disclosed in 
30 Han et al. {Nature Biotech. 19:631-635, 2001). Midticolor optical coded microbeads were 

created by embedding different sized quantum dots (zin-sulfide-c^ped cadmium selenide 

nanocrystals) into polymeric microbeads at precisely controlled rations. Although the 
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2001 publication concerned use of microbeads for fluorescent tagging and detection, the 
skilled artisan will realize that such beads could also be used in other detection modalities, 
such as SPM imaging. Alternatively, porous silicon photonic crystals, encoded through 
galvanostatic anodic etching, have been proposed (Cunin et al. Nature Materials 1:39-41, 
5 2002). Such micron sized, nanostructured particles may also be of use for SPM detection 
of nano-barcodes. 

Carbon Nanotubes 

[0059] Another exemplary nano-barcode of use in the disclosed methods comprises 
single-walled carbon nanotubes (SWNTs). Nanotubes may be made in a variety of 

10 shapes and sizes that may be distinguished by SPM methods. (See, e.g., Freitag et al^ 
Phys. Rev. B 62:R2307-R2310, 2000; Clauss etal, Europhys. Lett. 47:601-607, 1999; 
Clauss et al, Phys. Rev. B. 58:R4266-4269, 1998; Odom et al, Ann.N.Y. Acad. Sci, 
960:203-215, 2002). Odom et al (2002) disclose an STM (scanning tunneling 
microscope) technique that is capable of detecting discrete peaks in the timneling spectra 

15 of SWNTs of 10 mn or less in size. Such peaks may represent van Hove singularities in 
the density of electronic states (DOS) of the carbon nanotubes. 

[0060] The electronic properties of carbon nanotubes are modulated by the length of the 
tube. The sensitivity of the electronic wavefimction to length is illustrated by an estimate 
for the energy level splitting of a tube of length L. 

20 AE =;ivF/2L (Eq. 1) 

Where h is Planck's constant and vF is the Fermi velocity (8.1 x 10^ m/sec) (Venema et 
al, "Imaging Electron Wave Functions of Carbon Nanotubes," Los Alamos Physics 
Pr^rints:cond-mat/9811317, 23 Nov. 1996.) The difference between electron energy 
levels is inversely proportional to the length of the nanotube, with jSner splitting observed 
25 for longer tubes. 

[0061] For certain embodiments of the invention, nanotubes to be used as nano-barcodes 
may have tube lengflis of about 10 to 200 nm and a diameter of about 1.2 to 1 .4 nm. The 
length or diameter of the nanotubes to be used as nano-barcodes is not limited and 
nanotubes of vntually any length or diameter are contemplated 
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[0062] It is contemplated that nanotubes may be prepared by known methods or obtained 
from commercial sources, for example, CarboLex (Lexington, KY), NanoLab 
(Watertown, MA), Materials and Electrochemical Research (Tucson, AZ) or Carbon Nano 
Technologies Inc. ( Houston, T X). S ome p rocessing o f e ither s ynthesized o r p urchased 
5 nanotubes may be appropriate before use. Processing may include purification of 
nanotubes from other contaminants, separation of nanotubes of mixed diameter and/or 
length into nanotubes of discrete diameter and length, removal of nanotube end caps 
and/or covalent modification to facilitate attachment of the nanotube to a probe to fomi a 
coded probe. 

10 [0063] In certain embodiments of the invention, carbon nanotubes of varying length 
and/or diameter may be produced by a variety of techniques known in the art, including 
but not limited to carbon-arc discharge, chemical vapor deposition via catalytic pyrolysis 
of hydrocarbons, plasma assisted chemical vapor deposition, laser ablation of a catalytic 
metal-containing graphite target, or condensed-phase electrolysis. (See, e.g.^ U.S. Patent 

15 Nos, 6,258,401, 6,283,812 and 6,297,592.) In some embodiments, nanotubes may be size 
sorted by mass spectrometry (See, Parker et al, J. Am. Chem. Soc. 113:7499-7503, 1991). 
Alternatively, nanotubes may be sorted using an AFM (atomic force microscope) or STM 
(scanning tunneling microscope) to precisely measure the geometry of individual 
nanotubes before incorporating them into coded probes. Other methods of size 

20 fi^ctionation known in the art, such as gas chromatography, time of flight mass 
spectrometry, ultrafiltration or equivalent techniques are contemplated. Once sorted, the 
carbon nanotubes may be derivatized and covalently attached to ohgonucleotide probes of 
known sequence or any other type of probe. 

[0064] The minimum incremental change in tube length possible for a caibon nanotube is 
25 the length of the carbon-carbon bond, or about 0.142 nm. With a range of tube lengths of 
200 nm, this would allow for about 1400 discrete nano-barcodes. However, the method is 
not Ihnited to a single nanotube per coded probe. In altemative embodiments, multiple 
nanotubes of different length and diameter may be attached to a single coded probe. 
Using combinations of nanotubes of different length, the number of possible 
30 distinguishable nano-barcodes increases exponentially. In some embodiments, a single 
nanotube may be attached to a single probe molecule for simplicity of analysis. 
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[0065] Other embodiments of the invention concern methods of producing carbon 
nanotubes of defined length and diameter. In a non-limiting exemplary ^bodiment, a 
chip may contain a layer of SiC of preselected thickness, overlaying a layer composed, for 
example, of silicon or silicon doped with catalysts (e.g. metal atoms such as nickel). 
Using standard chip processing methods, such as photoUthography and etching or laser 
ablation, the SiC layer may be divided into SiC deposits of any length, width, thickness 
and shape. Subsequently the chip may be heated under a vacuum, for example at about 
10'^ Torr at about MOO^C, or alternatively from about 10'^ to 10''^ Torr, lO"* to 10"*^ Torr, 
or 10"^ to 10'^ Torr, and from 1200 to 2200°C or 1400 to 2000°C. Under these conditions, 
SiC crystals spontaneously decompose and lose siUcon atoms (U.S. Patent No. 6,303,094). 
The remaining carbon atoms spontaneously assemble into carbon nanotubes. The size and 
shape of the SiC deposits may be precisely controlled to produce carbon nanotubes of any 
length and diameter. 

[0066] The exemplary embodiments of the invention discussed above are not limiting and 
any method of producing carbon nanotubes of selected length and diameter may be used 
(e.^., U.S. Patent Nos. 6,258,401; 6,283,812 and 6,297,592). hi some embodiments, 
nanotube length may be adjusted by using a laser beam, electron beam, ion beam or gas 
plasma beam to trim the ends. Altematively, the ends of the nanotubes could be brought 
into contact with a hot blade in an oxygen-containing atmosphere to oxidatively remove 
the ends of the tubes. A block containing the nanotubes could also be sectioned or 
polished to truncate the nanotubes. 

[0067] In certain embodiments of the invention, carbon nanotubes may be derivatized 
with reactive groups to facilitate attachment to probe molecules. In a non-lintiiting 
example, nanotubes may be derivatized to contain carboxylic acid groups (U.S. Patent No. 
6, 1 87,823). C aiboxylate d erivatized n anotubes m ay b e a ttached top robe m olecules b y 
standard chemistries, for example by carbodiimide mediated formation of an amide 
linkage with a primary or secondary amine group located on the probe. The methods of 
derivatization and cross-linking are not Umiting and any reactive group or cross-linking 
methods known in the art may be used. 
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Fullerenes 

[0068] In alternative embodiments of the invention, fullerenes may be used to as nano- 
barcodes. Methods of producing fullerenes are well known {e,g., U.S. Patent No. 
6,358,375). Fullerenes may be derivatized and attached to probe molecules by methods 
5 similar to those disclosed above for carbon nanotubes. FuUerene-containing coded probes 
may be identified by SPM technologies, similar to those disclosed above for nanotubes. 

[0069] In certain embodiments of the invention, fiillerenes may be attached to individual 
nucleotides in an oligonucleotide coded probe. In such case, only two differCTt types of 
distinguishable fullerenes are required, as there are only four types of nucleotide found in 
10 an oligonucleotide and two types of fullerenes may be combined in four different 
combinations (e.g., AA, BB, AB and BA). Where individual nucleotides are attached to 
nano-barcodes, it may be appropriate to use known linking groups between tiiie nucleotide 
and the fixllerene to avoid steric hindrance with hybridization to target nucleic acids. 

[0070] The skilled artisan will realize that nano-barcodes of use in the disclosed methods 
15 are not lunited to the raibodiments disclosed herein, but may include any other type of 
knoAvn nano-barcode that may be attached to a probe and detected. Other non-limiting 
examples of nano-barcodes of potential use include quantum dots (e.g., Schoenfeld, et aL, 
Proc. 7th Int. Conf. on Modulated Semiconductor Structures, Madrid, pp. 605-608, 1995; 
Zhao, et aL, 1st Int. Conf on Low Dimensional Structures and Devices, Singapore, pp. 
20 467-471, 1995). Quantum dots and otiier types of nano-barcodes may be synthesized by 
known methods and/or obtained firom commercial sources (e.^.. Quantum Dot Corp., 
Hayward, CA). Other nano-barcodes of potential use include nanoparticles, available, for 
example, jfrom Nanoprobes Inc. (Yaphank, NY) and Polysciences, Inc. (Warrington, PA). 

Hybridization and Ligation of Oligonucleotide-Based Coded Probes 

25 [0071] In various embodiments of the invention, hybridization of a target nucleic acid to 
an oligonucleotide-based coded probe library may occur under stringent conditions that 
only allow hybridization between fiilly complementary nucleic acid sequences. Low 
stringency hybridization is generally performed at 0.15 M to 0.9 M NaCl at a temperature 
range of 20*^0 to 50*^0. High stringency hybridization is generally performed at 0.02 M to 

30 0. 1 5 M NaCl at a temperature range of SO^'C to 70''C. It is understood that the temperature 
and/or ionic strength of an appropriate stringency are determined in part by the length of 
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an o ligonucleotide p robe, t he b ase c ontent o f t he t arget s equences, a nd the p resence o f 
formamide, tetramethylanunoniiim chloride or other solvents in the hybridization mixture. 
The ranges mentioned above are exemplary and the appropriate stringency for a particular 
hybridization reaction is often determined empirically by comparison to positive and/or 
5 negative controls. The person of ordinary skill in the art is able to routinely adjust 
hybridization conditions to allow for only stringent hybridization between exactly 
complementary nucleic acid sequences to occur. 

[0072] Once short coded probes have been hybridized to a nucleic acid, adjacent coded 
probes may be ligated together using known methods (see, e.g., U.S. Patent Nos. 

10 6,013,456). Oligonucleotide sequences of as short as 6 to 8 bases may be efBciently 
hybridized to target nucleic acids (U.S. Patent No. 6,013,456). Primer independent 
Ugation may be accomplished using oligonucleotides of at least 6 to 8 bases in length 
(Kaczorowski and Szybalski, Gene 179:189-193, 1996; Kotler et aL, Proc. Natl. Acad. 
Sci. USA 90:4241-45, 1993). Methods of ligating oligonucleotide coded probes that are 

15 hybridized to a nucleic acid template are known in the art (U.S. Patent No. 6,013,456). 
Enzymatic ligation of adjacent oUgonucleotide c oded probes may utilize a DNA ligase, 
such as T4, T7 or Taq ligase or E. coli DNA ligase. Methods of enzymatic Hgation are 
known {e.g, , Sambrook et al , 1 989). 

Immobilization of Molecules 

20 [0073] In various embodiments of the invention, the target molecules to be analyzed may 
be immobilized prior to, subsequent to and/or during coded probe binding. For example, 
target molecule immobilization may be used to facilitate separation of boimd coded probes 
from imbound coded probes. In certain embodiments, target molecule iromobilization 
may also be used to separate bound coded probes from the target molecules before coded 

25 probe detection and/or identification. Although the following discussion is directed 
towards immobilization of nucleic acids, the skilled artisan will realize that methods of 
immobilizing various types of biomolecules are known in the art and may be used in the 
claimed methods. 

(0074] Nucleic acid immobilization may be used, for example, to facilitate separation of 
30 target nucleic acids from ligated coded probes and from unhybridized coded probes or 
coded probes hybridized to each other. In a non-limiting example, target nucleic acids 
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may be immobilized and allowed to hybridize to coded probes, after which hybridized 
adjacent coded probes are ligated togeflier. The substrate containing bound nucleic acids 
is extensively washed to remove unhybridized coded probes and coded probes hybridized 
to other coded probes. Following washing, the hybridized and Ugated coded probes may 
5 be removed from tiie immobilized target nucleic acids by heating to about 90 to 95^C for 
several minutes. The ligated coded probes may be attached to a surface and aligned by 
molecular combing, as disclosed above. The aligned coded probes may then be analyzed 
by SPM. 

[0075] hnmobilization of nucleic acids may be achieved by a variety of methods known in 
10 the art. hi an exemplary embodiment of the invention, immobilization may be achieved 
by coating a substrate with streptavidin or avidin and the subsequent attachment of a 
biotinylated nucleic acid OHobnstrom et al. Anal Biochem. 209:278-283, 1993). 
Immobilization may also occur by coating a silicon, glass or other substrate witii poly-L- 
Lys (lysine), followed by covalent attachment of either amino- or suUhydryl-modified 
15 nucleic acids using bifimctional crosslinking reagoits (Running et al, BioTechniques 
^ilie-m, 1990; Newton et al. Nucleic Acids Res, 21:1155-62, 1993). Amine residues 
may be introduced onto a substrate tiurough the use of aminosilane for cross-linking. 

[0076] Immobilization may take place by direct covalent attachment of 5 -phosphorylated 
nucleic acids to chemically modified substrates (Rasmussen et al. Anal Biochem. 

20 198:138-142, 1991). The covalent bond between the nucleic acid and the substrate is 
formed by condensation with a water-soluble carbodiimide or other cross-linking reagent. 
This method facilitates a predominantly 5*-attachment of the nucleic acids via their 5 - 
phosphates. Exemplary modified substrates would include a glass slide or cover sUp tiiat 
has been treated in an acid bath, exposing SiOH groups on the glass (U.S. Patent No. 

25 5,840,862). 

[0077] DNA is commonly boimd to glass by first silanizing the glass substrate, then 
activating with carbodiimide or glutaraldehyde. Altemative procedures may use reagents 
such as 3-glycidoxypropyltrimethoxysilane (GOP), vinyl silane or 
aminopropyltrimethoxysilane (APTS) with DNA linked via amino linkers incorporated 
30 either at the 3' or 5' end of the molecule. DNA may be bound directly to membrane 
substrates using ultraviolet radiation. Other non-limiting examples of immobilization 
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techniques for nucleic acids are disclosed in U.S. Patent Nos. 5,610,287, SJ76,674 and 
6,225,068. Conunercially available substrates for nucleic acid binding are available, such 
as Covalink, Costar, Estapor, Bangs and Dynal. The skilled artisan will realize that the 
disclosed methods are not limited to immobiUzation of nucleic acids and are also of 
potential use, for example, to attach one or both ends of oligonucleotide coded probes to a 
substrate. 

[0078] The type of substrate to be used for immobilization of the nucleic acid or other 
target molecule is not limiting. Jn various embodiments of the invention, the 
immobilization substrate may be magnetic beads, non-magnetic beads, a planar substrate 
or any other conformation of sohd substrate comprising almost any material. Non-limiting 
examples of substrates that may be used include glass, silica, silicate, PDMS (poly 
dimethyl siloxane), silver or other metal coated substrates, nitrocellulose, nylon, activated 
quartz, activated glass, polyvinylidene difluoride (PVDF), polystyrene, polyacrylamide, 
other polymers such as poly(vinyl chloride) or poly(methyl methacrylate), and 
photopolymers which contain photoreactive species such as nitrenes, carbenes and ketyl 
radicals capable of forming covalent links with nucleic acid molecules (See U.S. Pat. Nos. 
5,405,766 and 5,986,076). 

[0079] Bifimctional cross-linking reagents may be of use in various embodiments of the 
invention. The bifunctional cross-linking reagents can be divided according to the 
specificity of their functional groups, e,g., amino, guanidino, indole, or carboxyl specific 
groups. Of these, reagents directed to free amino groups are popular because of their 
commercial availability, ease of synthesis and the mild reaction conditions under which 
they can be applied. Exemplary methods for cross-linking molecules are disclosed in U.S. 
Patent Nos. 5,603,872 and 5,401,511. Cross-linking reagents include glutaraldehyde 
(GAD), bifunctional oxirane (OXR), ethylene glycol diglycidyl ether (EGDE), and 
carbodiimides, such as l-ethyl-3-(3-dimethylaminopropyl) carbodiimide (EDC). 
Scanning Probe Microscopy 

[0080] Scanning probe microscopes (SPM) are a family o f instruments that are used to 
measure the physical properties of objects on a micrometer and/or nanometer scale. 
Different modalities of SPM technology are available, discussed in more detail below. 
Any modality of SPM analysis may be used for coded probe detection and/or 
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identification. In general, an SPM instrument uses a very small, pointed probe in very 
close proximity to a surface to measure the properties of objects. In some types of SPM 
instruments, the probe may be mounted on a cantilever that may be a few hundred microns 
in length and between about 0.5 and 5.0 microns thick. Typically, the probe tip is raster- 
5 scanned across a surface in an xy pattem to map localized variations in surface properties. 
SPM methods of use for imaging biomolecules and/or detecting molecules of use as nano- 
barcodes are known in the art (e.g., Wang et aL, Amer. Chem.Soc, Lett., 12:1697-98. 
1996; Kim et al, Appl. Surface Sci. 130, 230, 340-132:602-609, 1998; Kobayashi et al, 
Appl. Surface Sci. 157:228-32, 2000; Hirahara et al, Phys. Rev. Lett. 85:5384-87, 2000; 
10 IQein et al. Applied Phys. Lett. 78:2396-98, 2001; Huang et al. Science 291:630-33, 
2001; Ando et al, Proc. Natl. Acad. Sci. USA 12468-72, 2001). 

Scanning Tunneling Microscopy (STM) 

[0081] Scanning tunneling microscopy was the first SPM technique developed in the early 
1980's. STM reUes on the existence of quantum mechanical electron tunneling between 

15 the probe tip and sample surface. The tip is sharpened to a single atom point and is raster 
scanned across tiie surface, maintaining a probe-surface gap distance of a few angstroms 
without actually contacting the surface. A small electrical voltage difference (on the order 
of millivolts to a few volts) is sqiplied between the probe tip and sample and the tunneling 
current between tip and sample is determined. As the tip scans across the surfaces, 

20 differences in the electrical and topographic properties of the sample cause variations in 
the amount o f tunneling current. In certain embodiments o f the invention, the relative 
height of the tip may be controlled by piezoelectric elements with feed-back control, 
interfaced with a computer. The computer can monitor the current intensity in real time 
and move the tip up or down to maintain a relatively constant current In different 

25 embodiments, the height of the tip and/or current intensity may be processed by the 
computer to develop an image of the scanned sur&ce. 

[0082] Because STM measures the electrical properties of the sample as well as the 
sample topography, it is capable of distinguishing between different types of conductive 
material, such as different types of metal in a metal barcode. STM is also capable of 
30 measuring local electron daisity. Because the tunneling conductance is proportional to the 
local density of states (DOS), STM can also be used to distinguish carbon nanotubes that 
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vary in their electronic properties depending on the diameter and length of the nanotube. 
STM may be used to detect and/or identify any nano-barcodes that differ in their electrical 
properties. 

[0083] An STM probe tip may be scanned across a surface containing aligned coded 
5 probes to detect and identify each coded probe on the surface. Ligated coded probes may 
also be identified. Target molecules may be identified by determining which coded probes 
bind to the target molecule. In embodiments of the invention where the coded probes 
indicate the presence of specific sequences (such as oligonucleotide sequences), the 
sequence of the biomolecule may be determined firom the sequence of the coded probes 
1 0 that bind to the target molecule. 

Atomic Force Microscopy 

[0084] Another modality of SPM is atomic force microscopy (AFM). Methods of 
biomolecule analysis by AFM are generally known in the art (eg., Uchihashi et aL, 
"Application of Noncontact-Mode Atomic Force Microscopy to Molecular Imaging," 
15 httpy/www.foresightorg/Conferences/MNT7/Abstracts/Uchihashi). In AFM microscopy, 
the probe is attached to a spring-loaded or flexible cantilever that is in contact with the 
surface to be analyzed. Contact is made within the molecular force range (i.e., within the 
range of interaction of Van der Waal forces). Within AFM, different modes of operation 
are possible, including contact mode, non-contact mode and TappingMode™. 

20 [0085] In contact mode, the atomic force between probe tip and sample surface is 
measured by keeping the tip-sample distance constant and measuring the deflection of the 
cantilever, typically by reflecting a laser off the cantilever onto a position sensitive 
detector. Cantilever deflection results in a change in position of the reflected laser beam. 
As in STM, the height of the probe tip may be computer controlled using piezoelectric 

25 elements with feedback control. In some embodiments of the invention a relatively 
constant degree of deflection is maintained by raising or lowering the probe tip. Because 
the probe tip may be in actual (Van der Waal) contact with the sample, contact mode AFM 
tends to defomi non-rigid samples. In non-contact mode, the tip is maintained between 
about 5 0 to 1 50 angstrom above the sample surface and the tip i s o scillated. V an der 

30 Waals interactions between the tip and sample surface are reflected in changes in the 
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phase, amplitude or frequency of tip oscillation. The resolution achieved in non-contact 
mode is relatively low. 

[0086] In T appingMode™, t he cantilever i s o scillated a t o r n ear i ts r esonant f requency 
using piezoelectric elements. The AFM tip periodically contacts (taps) the sample surface, 
5 at a frequency of about 50,000 to 500,000 cycles per second in air and a lower frequency 
in liquids. As the tip begins to contact the sample sxutface, the amplitude of the oscillation 
decreases. Changes in amplitude are used to determine topographic properties of the 
sample. Because AFM analysis does not depend on electrical conductance, it may be used 
to analyze the topological properties of non-conductive materials. Certain types of nano- 
10 barcodes, including but not limited to carbon nanotubes, fiillarenes and nanoparticles, that 
differ in their topological properties may be detected and/or identified by AFM techniques. 

[00871 In alternative modes of AFM, additional information may be obtained besides the 
topological profile of the sample. For example, in lateral force microscopy (LFM), the 
probe is scanned perpendicular to its length and the degree of torsion of the cantilever is 
15 determined. Cantilever torsion will be dependent on the fiictional characteristics of the 
surface. Since the fiictional characteristics of coded probes may vary depending on their 
composition, LFM may be usefiil to detect and identify different coded probes. 

[0088] Another variation is chemical force microscopy (CFM), in which the probe tip is 
functionalized with a chemical species and scanned over a sample to detect adhesion 

20 forces between the chemical species and the sample (e.g., Frisbie et al^ Science 265:2071- 
2074, 1994). Chemicals with differing affinities for nano-barcode materials, such as gold 
or silvCT, may be incorporated into an AFM probe tip and scaitmed aaross a surface to 
detect and identify nano-barcodes. Another SPM mode of potential use is force 
modulation imaging (Maivald et al, Nanotechnology 2:103, 1991), Uchihashi et al 

25 (http://www.foresight.org/Conferences/MNT7/Abstracts/Uchihashi) disclose a method of 
biomolecule imaging using frequency modulation in non-contact mode AFM. 

[0089] Other SPM modes tiiat may potentially be used to detect and/or identify coded 
probes include magnetic force microscopy (MFM), high frequency MFM, 
magnetoresistive sensitivity mapping ^SM), electric force microscopy (EFM), scanning 
30 cE^acitance microscopy (SCM), scanning spreading resistance microscopy (SSRM), 
tunneling AFM and conductive AFM. Li certain of these modalities, magnetic properties 
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of a sample may be detemiined. The skilled artisan will realize that metal barcodes and 
other types of nano-barcodes may be designed that are identifiable by their magnetic as 
well as by electrical properties. 

[0090] SPM instruments of use for coded probe detection and/or identification are 
5 commercially available (e.g, Veeco Instruments, Inc., Plainview, NY; Digital Instruments, 
Oakland, CA). Alternatively, custom designed SPM instruments may be used. 

Nano-barcodes and Scanning Probe Microscopy 

[00911 Exemplary embodiments of the invention are illustrated in FIG. 1 through FIG. 4. 
FIG. lA and FIG. IB iUustrate a non-limiting method for aligning coded probes 130 on a 
10 surface 100. A surface 100, for example a glass microscope slide 100 that has been coated 
with streptavidin by known methods, is immersed in a solution 110 containing, for 
example, biotinylated coded probes 130. The solution 110 may be contained in a 
container 120. 

[0092] In a non-limiting example, the coded probes 130 comprise oligonucleotide probes 
15 that have been hybridized to a target nucleic acid molecule. The nucleic acid molecule 
may be immobilized by attachment to a nylon membrane, 96-well microtiter plate or other 
immobilization substrate. Biotinylated oligonucleotides comprising, for example, all 4096 
possible 6-mer sequences may be obtained from commercial sources (e.g.^ Midland 
Certified Reagents, Midland, TX). The biotinylated oUgonucleotides may be attached, for 
20 example, to submicrometer nietaUic barcodes (Nicewamer-Pena et al, 2001) to form 
coded probes 130. The coded probes 130 are allowed to hybridize to a target nucleic acid. 
After hybridization, adjacent coded probes 130 are ligated together using ligase. 
Unhybridized coded probes 130 and coded probes 130 hybridized to each other are 
removed by extensive washing, leaving only coded probes 130 that are hybridized to the 
25 nucleic acid. The coded probes 130 are removed by heating the solution 1 10 to 95*C for 
five minutes. The nucleic acid attached to the immobilization substrate is removed, 
leaving only Ugated coded probes in solution 110. 

[0093] The biotinylated coded probes 130 remaining in solution 110 attach at one end to 
the streptavidin coated surface 100. The surface 100 is slowly removed from the solution 
30 110. A Itematively, 1 iquid from the s olution 1 10 i s s lowly removed from t he c ontainer 
120, for example by evaporation or slow pumping. As the meniscus of the air-water 
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interface slowly moves across the surface 100, fee attached coded probes 130 are aligned 
on the surface 100. The aligned coded probes 130 may be analyzed by AEM, STM or 
other scanning probe methods. 

[0094] Another exemplaiy embodiment of the invention is illustrated in FIG. 2. A drop of 
5 solution 210 containing coded probes 230 is placed on a surface 220, such as a glass slide. 
In certain embodiments, the slide 220 may be treated as disclosed above to bind one or 
both ends of the coded probes 230. The drop 210 is sandwiched between the surface 220 
and a glass cover slip 200. In various embodiments, the cover slip 200 may be held in a 
constant position while the surface 220 is slowly pulled away from the cover slip 200. 
10 This creates a meniscus at the edge of the cover slip 200 that serves to align the coded 
probes 230. 

[0095] In various embodiments of the invention, the coded probes 130, 230 may be 
attached to a surface 100, 220 at both ends rather than at one end. In this case, alignment 
of the coded probes 130, 230 would result in a U-shaped molecule, instead of a linearized 
15 molecule (e.g. U.S. Patent No. 5,840,862). The exemplary embodiments illustrated in 
FIG. 1 and FIG. 2 can also be performed by attaching both ends of the coded probes 130, 
230 to the surface 100, 220 (not shown). 

[0096] In another exemplary embodiment, illustrated in FIG. 3, coded probes 340 may be 
aligned o n a s urface 3 00 b y free flow e lectrophoresis. T he s ur&ce 3 00 m ay comprise 

20 alternating bands of conductive and non-conductive materials, such as strips of gold film 
310 coated onto a glass sheet 320. In the presence of an alternating current electrical field 
330, coded probes 340 comprising charged residues, such as the phosphate groups on 
oUgonucleotides, will align with the field 330. Free flow electrophoresis may be used in 
addition to or instead of molecular combing to align coded probes 340 on a surface 300. 

25 Mettiods of performing free flow electrophoresis are known (e.g,, Adjari and Prost, Proc. 
Natl. Acad. Sci. U.S.A. 88:4468-71, 1991). However, the present application presents the 
first use of free flow electrophoresis for aligning molecules on a surface. 
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EXAMPLES 

Example 1 : Nano-tag Elements 

[0097] Table 1 shows an exemplary list of nano-tag elements whose molecular structures 
may be of use for nano-barcode production. Most of the listed elements have nanometer- 
5 sized features. The nano-tag elements are grouped by parent stmctures and are divided 
into fullerene molecules, POSS ^lyhedral oligomeric silsesquioxane) molecules, and 
organo-metallic compounds. POSS molecules are a relatively new type of hybrid 
structure, based on the silsesquioxane cage stracture. Fullerene and POSS molecules are 
nearly symmetric three-dimensional structures, whereas the organometallic compounds are 

10 either planar or non-symmetric three-dimensional structures. Fullerenes exhibit relatively 
low solubility in any solvent. POSS molecules are used as additives in plastic polymers 
and tend to exhibit aggregation over individixal deposition. Organometallic compounds 
have t he a dvantage o f enormous d iversity, d ue t o t he c ombinatorial p airing o f m etallic 
centers with o rganic c oimterparts. hi a ddition, the t hree-dimensional d iversity oft hese 

15 molecules ranges from 3D asyrometric to 2D symmetric. Many of the organometallic 
compounds can be bi-fimctionalized for downstream processing of elements into barcodes. 
Organometallic molecules have been subjects of numerous imaging studies and have 
features that are readily observed by molecular imaging. The skilled artisan will realize 
that the nano-tag elements listed in Table 1 are exemplary only. A wide variety of nano- 

20 tag elements are known in the art and may be used to make nano-barcodes, including but 
not limited to quantum dots and carbon nanotubes. Any such known nano-tag element 
may be used to produce nano-barcodes and coded probes. 

Example 2. Synthetic Schemes 

Bifunctional Intermediates 

25 [00981 FIG. 5 illustrates an exemplary scheme for nano-tag-mediated barcode synthesis, 
involving p roduction o f b i-functionalized n ano-tag e lements t hat a re u sed a s a b uilding 
block f or a c ontrolled s tepwise h ead-to-tail a ssembly o f i ndividual u nits i nto a s pecific 
polymer sequence. This approach has been used in molecular biochemistry to make 
peptides and oligonucleotides on automated soUd phase synthesizers. FIG. 5A shows the 

30 initial conversion of an exemplary nano-tag element into a bi-functional molecule. Two 
functional moieties (Rl and R2) are shown attached to opposite ends of the tag element. 
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FIG. 5B illustrates the selective and transient protection of one group with activation of 
the other functional group. Such techniques are well known, for example, in solid phase 
peptide synthesis. FIG. 5C illustrates the stepwise addition of building blocks in a 
controlled polymerization. Typically, the transient protecting group is removed from the 
5 terminus of the growing polymer and the next building block is coupled to the newly 
deprotected terminus. Multiple cycles give rise to increasing length of barcode polymer. 

[0099] Exemplary Rl functional groups include CH2OH and CONHC3H6NH2. 
Exemplary R2 functional groups include CH2OH and COOH. Where a CH2OH is used as 
the Rl and R2 groups, the Rl group may be protected with a dimethoxytrityl moiety, 

10 which may be rraaoved by acid treatment. The R2 group may then be activated by 
cyanoethyl-N,N-diisopropyl phosphoramidite. Where the Rl group is CONHC3H6NH2 
and the R2 group is COOH, the Rl group may be protected by a trityl moiety, which may 
be removed by acid treatment. The R2 group may be activated, for example, by 
carbodiimide treatment. Ottier protection/deprotection chemistries are well known, for 

15 example in solid phase peptide or oligonucleotide synthesis and any such known mettiods 
may be utilized for nano-barcode production. 

Backbone Mediated Synthesis 

[0100] Backbone mediated nano-barcode synthesis is modeled after standard peptide or 
oligonucleotide solid phase synthesis. The nano-tag element is converted into a mono- 

20 fimctionalized analog and tihen attached to eiflier an amino acid or to a nucleotide 
phosphoramidite. The building blocks would be appropriately blocked and activated for 
standard automated solid phase synthesis. The overall scheme is illustrated in FIG. 6. The 
tag unit is initially monofunctionalized by addition of an ^propriate R groi^ (FIG. 6A). 
Using either peptide or oligonucleotide based polymerization, the fimctionalized tag group 

25 is converted to a covalently tagged amino acid subvmit (FIG. 6B) or nucleotide subunit 
(FIG. 6C). 

[0101] One consideration in such a scheme is the choice of backbone and the known 
physical and chemical properties of naturally occurring polymeric molecules, such as 
polypeptides and oligonucleotides. Chemical attachment of tag el^nents to amino acids or 
30 oligonucleotide analogs should be of equal difficulty. The phosphoramidite 
polymerization chemistry is 1 0 f old m ore robust than p eptide chemistry, and w ould b e 
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more compatible with the down stream synthesis of coded probes. However, peptides can 
give rise to secondary structures such as the alpha helix that could provide structural 
entropy to the coded probes. Table 2 summarizes candidates based on commercially 
available starting products. The skilled artisan will realize that the listed candidate 
5 subunits are exemplary only and that a wide variety of other potential subunits are known 
in the art and may be utilized. 

Polymer Decoration With Nano-tag Elements 

[0102] Another altemative approach to coded probe synthesis entails creating polymer 
scaffolds to which nano-tag elements are attached through post-polymer assembly. For 

10 example, peptides and oligonucleotides provide linear scaffold molecules to which nano- 
tag elements may b e attached post-assembly. Advantageously, methods o f peptide and 
oUgonucleotide production are well known in the art. However, other forms of nano- 
structures may provide multi-dimensional scaffolds. A difficulty with this approach is that 
it is difficult to put more than one kind of tag element (not including spacers) into the 

15 polymer. The high specificity for protection/deprotection also limits such schemes. Steric 
hindrance may also prevent complete decoration. This process is the least difficult for 
creating exemplary of coded probes, as the polymer synthesis part of the scheme is well 
characterized and methods of post-translational modification of peptides and 
oligonucleotides are known. A non-limiting example of a coded probe based on 

20 oligonucleotides is provided in the Examples below. The branch points on the exemplary 
oligonucleotide based coded probe may be detected by SPM techniques, or may serve as 
attachment sites for nanoparticles or other types of nano-tag elements. 

[0103] A peptide or oligonucleotide that has active groups at specific appropriately spaced 
sites may be purchased from commercial sources. The polymer may then be exposed to a 
25 mono-functionalized nano-tag element and all of the active sites would be modified. 
Depending on the strategy, solid phase chemistry techniques may be used for decorating 
Avith nano-tag elements, preventing unwanted intermolecular polymerization. 

[0104] Table 3 lists exemplary mono-functionalized tag elements and their related 
polymers for decoration. All components listed are currently commercially available and 
30 the decoration chemistry is known. Complete labeling and solubility are important. As 
the molecules are decorated, their solubihty is affected, often leading to precipitation. 
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Also, the structures are subject to secondary and tertiary structural properties, giving rise 
to complex folding patterns. Folding may be affected by deposition onto a flat surface. 
Thus, a more rigid backbone may exhibit certain advantages. The skilled artisan will 
realize ttiat the listed functionalized nano-tag elements are not limiting and that a variety 
of other functionalized nano-tag elements, such as quantum dots or carbon nanotubes, are 
known and may be used. 

Direct read of polymer subunits 

[0105] A fourth strategy is based on STM imaging of charge densities of certain amino 
acids or nucleotide analogs. This approach could be accompHshed by commercially 
obtaining peptide or oligonucleotide synthesis of specified sequences, followed by 
spotting a nd i maging. Table 4 1 ists s everal e XCTaplary s ubunits a nd t heir i ncorporation 
into a polymer sequence. Secondary structures and imaging presentation may be 
considered when designing these polymers. 

[0106] Exemplary coded probe subunits were and their characteristics were determined, as 
disclosed in the following Examples. 

Example 3. Synthesis of Exemplary Coded Probe Subunits 

Peptide Polymers 

[0107] An exemplary peptide polymer, of potential use for production of either a 
decorated polymer or direct polymer imaging, was prepared (SEQ ID NOrl). A 5 mg 
scale solid-phase peptide ^utiiesis was performed. The resulting peptide was HPLC 
purified to about 98% purity. Mass spectroscopy was used to demonstrate the presence of 
flie fiiU length product The carboxyl terminal end of the peptide was modified to form an 
amide terminal group. 

AAMAAKAMAAMAKAVAMAAKAVAAMAKAAA (SEQ ID NO:l) 

[0108] The sequence was predicted to be an alpha helix, based on sequence similarity to 
kCTatins, Rop protein and poly-alanine, with the amino terminus and the secondary amines 
fix>m lysine facing the same side of the helix. These amines make excell^^t attachment 
sites for any molecule mono-fimctionaUzed with an activated carboxyl group, using 
standard dicyclohexylcarbodiimide (DCC) or water soluble carbodiimide cross-linking. 
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The amide blocking group on the carboxyl terminus was used to prevent polymerization of 
peptides to each other. 

[0109] Potential quaternary structure formation into helical bundles (e,g,, 4-helix bundle) 
may be eliminated or minimized by decorating the lysine side chains. Peptide 
5 concentration also plays a role in affecting the formation of higher ordered structures. 

[01 1 OJ A s econd s ynthetic p eptide s equence (SEQ ID NO:2) w as p rq>ared b y standard 
solid-phase peptide synthesis, as discussed above. The sequence was designed to examine 
the different amino acids and their imaging capacity. An alpha helix structure was 
designed by loading one side with helix preferring amino acids and decorating the other 
1 0 side w ith t he s ide c hains o f o ther a mino a cids. T hus, a lanine a nd m ethionine r esidues 
were placed on one side of the helix, while representatives of the remaining amino acids 
were placed on the other side of the helix. 

GALYAMARAVHAMAEAACQAAWAMG (SEQ ED NO:2) 

1 5 Bi-functional Fidlerenes 

[0111] An exemplary bifimctional nano-barcode subunit, compatible with solid phase 
oligonucleotide synthesis, was designed around the structure of a modified C70 fullerene. 
In certain embodiments of the invention, primary alcohols are utilized for the two 
functional groups, thus creating a diol-fiillerene analog. Secondary and tertiary alcohols 

20 may also be used, although they are less reactive and more sterically hindered. The first 
part of the synthesis involves fomiation of the bi-functionalized fullerene molecule, where 
the fiinctional groups may be OH, CH2OH or COOH. In altemative embodiments, a di- 
carboxylated fullerene may be prepared and the carboxyl groups reacted with a reagent 
such as l-amino,3-propanol in the presence of a condensing reagent {e.g., DCC) to create 

25 two alcohols. The amine groups condense with the carboxyls, resulting in the attachment 
of a hydrocarbon chain terminating in a hydroxyl residue. This pathway can lead to 
several useful products differing in the length of the hydrocarbon chain and ultimately 
affecting the spacing of fullerenes upon controlled assembly of a fullerene chain. One 
caveat of using a carboxylated fullerene is the additional reactions involved, resulting in 

30 less product and lower yield. 
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[0112] Following derivitization, the bi-functionalized product(s) are purified. In order to 
effectively form coded probes from the bi-functionalized subunit, the two functional 
groups are located at opposite ends of the molecule. Impurities with one functionality or 
with two adjacent modifications are removed. A Ithough location of the two functional 
5 groups at less than 180*" separation may be feasible, the location of the two functional 
groups at less than 150"* is not acceptable for coded probe formation. 

[0113] The purified diol product is further modified into a building block for synthesis, as 
disclosed in FIG. 7. The tritylation reaction is straightforward, involving the reaction of 
the diol modified fuUerene with a dimethyl-trityl-chloride. The derivatization with DMT- 

10 CI produces a mixture of the desired mono-DMT and bis-DMT. The mono-tritylated 
product is purified and separated from the di-tritylated product. Phosphoramidation of ttie 
monotritylated product occurs by a standard reaction xmder inert reaction conditions. The 
chloro-2cyanoethyl-N,Ndiisopropyl-phosphoramidite reagent is conamercially available 
and i s b est u sed fresh and o nly o nee. T he r eaction p roceeds i mmediately, giving high 

15 yield. The products formed in this reaction will give increasing mobility on silica gel 
chromatography, allowing simple purification. The final product is often cleaned using a 
small pad of silica. The final product is dried under vacuxmi and stored dry under argon. 
The product may be incorporated into a polymer sequence using standard phosphoramidite 
chemistry. 

20 [0114] Taking advantage of the non-perfect spherical shape of C(70) presumably due to 
localized electron distributions focused on two opposite polar ends, a bi-functional C(70) 
subunit was designed. The C(70) fullerene was chosen to provide a scaSbId with two 
types of most reactive bonds located at the opposite sites. Five bi-substituted isomers 
were expected, including two pairs of enantiomers. A design incorporating two types of 

25 hydroxyl groups, which are present in nucleosides, allows the use of common protocols 
for oUgonucleotide synthesis. The primary alcohol is predominantly derivatized by DMT- 
CI, leaving a secondary alcohol available for phosphitylation. The alcohol groups are 
separated from the C(70) scaffold by a C2-C8 linker. 

[0115] An exemplary structure of a bi-functional fullerene, containing a primary and a 
30 secondary alcohol, is illustrated in FIG. 8. To form the primary alcohol, a monocarboxylic 
acid moiety can be introduced by Refoimatsky-type reaction using organozinc reagents or 
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by Prato addition using, for example, succinic acid semi-aldehyde leading to a substituted 
pyrrolidine derivative. Expected yields are 15-30%. The C(70) ester can be reduced with 
Dibal-H together with the carbonyl group. The preciu^or of the secondary alcohol moiety, 
a ketone, may be introduced using the Prato addition or Diels-Alder approach, with the 
5 commercially available trimethylsilyl enol of methyl vinyl ketone. The yields are 35% 
and 50%. 

Example 4. Production of Exemplary Bi-functional Fullerene 

[0116] In an exemplary embodiment of the invention, a C70 fuUerene-diol was produced 
that was converted to the DMTr (dimethoxytrityl) protected and phosphoramidite 
10 activated compoimd. This product may be used for sjmthesis of nano-barcodes. The 
phosphate backbone created by condensation of the phosphoramidite moiety may enhance 
water solubility of the fullerenes, facilitating later use of the resulting coded probes. 

[0117] A C70-diol intermediate of use for bifimctional fullerene synthesis was obtained 
from New England Pqjtide synthesis division (Fitchbiurg, MA). The intermediate product 
15 had limited solubility in appropriate solvents for blocking activation and polymerization. 
The product showed spontaneous reversion to the parent compound with a half-life 
estimated at —1.0 year. 

[0118] OUgonucleotide and peptide nucleic acid-based coded probes were also produced, 
using the schemes disclosed above, synthesized by Midland Certified Reagents (Midland, 
20 TX), AppHed Biosystems (Foster City, CA) or QIAGEN Operon (Alameda, CA). 

Example 5. Substrate Preparation and Molecule Attachment 

[0119] A variety of substrates may be used for imaging of coded probes. Imaging is slow 
(on the order of minutes) and molecules move rapidly (fractions of seconds). Thus, in 
order t o 1 imit t he m olecular m otions, s amples must b e a bsorbed o nto t he s ubstrate and 

25 become part of the crystal lattice. The imaging of DNA by AFM using mica exemplifies 
this concept. DNA binds mica through the phosphate backbone using a divalent metal 
such as Ni^^ or Mg^^. DNA and mica are both negatively charged, and it is necessary to 
use a counterion such as Mg^"^ or Ni^^ to adsorb DNA onto the mica (Biophys. J. 70:1933, 
1996; PNAS 94:496, 1997; Biochemistry 36:461, 1997). The divalent cations work as a 

30 coxmterion on the negatively charged DNA backbone and also give additional charges to 
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bind the mica. AP-mica (fimctionalized aminopropyl mica) has been used to bind DNA for 
AFM (Proc. Natl Acad. Set USA 94:496, 1997). 

Annealing Gold-on-Mica Substrates 

[0120] A quartz capillary torch was made by pulling a piece of 1 .00 mm o.d., 0.75 mm i.d. 
5 quartz capillary in a Sutter Instruments P-2000 capillary puller. The glass was scored and 
broken at a point where the capillary had an ID of about 200 \xm. The surface was then 
lapped flat and polished using 3M imperial lapping film. Quartz discs were heated on a 
heating block at 130°C for 5 minutes. Hie discs were flamed with a hydrogen torch using 
a 1.5 inch flame firom the quartz tip. Fresh gold substrate was placed (butter side up) on 

10 the center of the disc using tweezers. The substrate was held down using a pre-flamed 
1cm X 1cm X 1mm quartz block which only touched the mica surface and was left to heat 
for 5 minutes. The quartz capillary torch was held at 30° to the plane of the disc, such that 
the tip of the flame just touched the gold surface. The flame was passed repetitively over 
the gold surface (45 times) using a two inch pass in one second cycles. The substrate was 

1 5 stored under argon in its original container until use. 

DNA Deposition on Substrate 

[0121] DNA was deposited on mica and scanned by AFM. A population of different size 
plasmid molecules (differing by 1,000 bases in length) ranging from 1-1 0Kb was used and 
AFM images were obtained (not shown). 

20 Direct Attachment to Gold 

[0122] Molecules to be imaged may be attached to a substrate directly or indirectly. 
Direct attachment involve modifying the nano-tag with a functional group that 
^ecifically reacts with tiie substrate to create a covalent bond. Conditions for 
nucleophiUc attack of sulfur on reduced gold under aqueous conditions are known in the 

25 art. This approach has been optimized and spears feasible under mild conditions. Tlie 
redox kinetics for direct attachment may be controlled with pH. The reaction is specific 
and should not cross-react with other nano-tags. Another approach may use a more 
reactive attacking group, such as a radical based mechanisms or photo-catalyzed reactions. 
In general, radical reaction kinetics are fast and robust, but often lack control. One final 

30 approach uses a caged sulfur analog that is deprotected with light or pH. This approach 
would use a similar mechanism as the first approach but has an added element of 
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specificity for initiating and localizing the reaction. Exemplary reactive moieties for 
covalent attachment to gold surfaces include sulfhydryl groups, oxygen radicals, carbon 
radicals and photoactivated reagents, such as various sulfur compounds known in the art. 
Of these, attachment of sulfhydryl-modified oUgonucleotides to gold surfaces is the most 
5 extensively studied and has been disclosed in nimierous publications. In addition, thiol- 
modified oligonucleotide probes are commercially available from standard sources (e,g,. 
Midland Certified Reagents, Midland TX). 

Indirect Attachment to Gold 

[0123] Indirect attachment of targets to substrates involves a "linker" molecule to provide 
10 an attachment site to the substrate as well as to the nano-barcode. In this strategy a bi- 
fimctional linker molecule is used. The linker molecule has one fimctional group for 
attachment to gold and another for attachment to the barcode. One advantage to this 
approach is that a substrate can be modified with linker molecules at the desired density 
and verified by imaging prior to attaching the barcodes through a second reaction. 

15 [0124] In a non-limiting example, the linker molecule m ay b e attached to gold using a 
sulfhydryl group, wifli a different fimctional group at the opposite end of the linker. One 
non-limiting example would be a carboxyl group. Aminolated spacer barcodes can be 
specifically (but irreversibly) reacted with a terminal carboxyl. C arbodiimide-mediated 
condensation of amine with carboxyl groups is a well-understood chemical route. 

20 Example 7. STM Imaging 

Gold Nanoparticles 

[0125] AFM images were obtained with gold nanoparticles and lambda DNA. The 
substrates used were a poly L-lysine coated glass cover slip and amino-treated mica (AP- 
mica). AP-mica was obtained by vapor phase treatment of freshly cleaved mica with 3- 

25 aminopropyltriethoxy silane). Gold nanoparticles of 50 nm, 10 nm, 5 nm and 2 were 
purchased from Ted-pella Inc. (Redding, CA). With a poly L-lysine coverslip substrate, 
10 [il of gold colloidal solution was left to dry on the coverslip. With AP-mica, 100 \i\ of 
gold colloidal solution was placed on the substrate for 15 min. Excess solution was then 
wicked off with a Kimwipe. AFM imaging of the AP-mica substrate, using a Digital 

30 Instruments NanoScope® in tapping mode AFM, showed a smooth, featureless surface. 
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The AP-mica proved to be a good surface for immobilizing gold nanoparticles. The 50 
mn gold nanoparticles were easily imaged by AFM (not shown). The 5 and 10 nm gold 
nanoparticles were also clearly visible by AFM (not shown). The 2 nm gold nanoparticles 
were individually distinguishable^ although the image resolution was not as sharp as with 
5 larger nanoparticles (not shown). 

[0126] It was possible to distinguish between different sized nanoparticles hi a mixture of 
10, 5 and 2 nm gold nanoparticles (not shown). The 2 and 5 nm nanoparticles could be 
distinguished by the measured height using tapping mode AFM. These results show that 
nano-barcodes based upon different sized nanoparticles may be distinguished by SPM 
1 0 imaging techniques. 

[0127] In another non-limiting example, 20 |il of poly-L-lysine solution (0.01% from 
Sigma Chemicals, St. Louis, MO) was placed onto a mica substrate for about 5 minutes, 
then rinsed with nanopure water (18 MQ) and dried under filtered N2 gas. Gold 
nanoparticles (from Polysciences or Ted-Pella Inc.) were sonicated for 30 sec. A 25 \i\ 
15 sample of undiluted nanoparticles was placed onto the poly-L-lysine coated mica for about 
10 min» then rinsed with nanopure water and dried under filtered N2 gas. Images were 
obtained with a Digital Instruments NanoScope® in tapping mode AFM (not shown). 

[0128] A Hind IQ digest of lambda DNA was also imaged by AFM. A 1 p.g/ml solution 
of digested lambda DNA was prepared in HEPES buffer (40 mM HEPES, 5 mM NiQ, pH 
20 6.8). A 30 yd sample of DNA solution was deposited onto a treated mica substrate for 10 
min, rinsed with nanopure water and dried under N2 gas. The AFM images of digested 
lambda DNA are shown in FIG. 9. The double-stranded DNA molecules are clearly 
visible by AFM imaging. 

FuUerenes 

25 [0129] An image of a single fiillerene molecule deposited on a graphite surface was 
obtained by STM imaging, using a Digital Instruments NanoScope® with a 14.46 nm scan 
size (not shown). Multiple fiillerenes were connected by peptides and imaged. Fo\ir 
fiiUerenes were attached to the peptide of SEQ ID NO:l and an image was obtained by 
STM scanning, showing each of the four fiillerenes (not shown). 

30 Examples. Alignmentof Nucleic Acids 

37 



BNSDOCID: <WO__2004038037A2J_> 



wo 2004/038037 



PCTAJS2003/029726 



[01301 Lambda DNA was aligned by microfluidic molecular combing. A microfluidic 
channel was prepared in a layer of PDMS overlaying a substrate. Microfluidic channels 
were made by molding polydimethylsiloxane (PDMS) according to Anderson et al 
("Fabrication of topologically complex three-dimensional microfluidic systems in PDMS 
by rapid prototyping," Anal Chem. 72:3158-3164, 2000). The substrate may comprise, 
for example, AP-mica or a gold coated substrate prepared as discussed above. A sample 
may b e i ntroduced i nto a c hamber at o ne end o f a m icrofluidic c hannel and a v acumn 
applied to a reservoir at the other end of the channel. The addition of one or more posts 
v^thin the channel allows for molecule alignment by molecular combing. The PDMS 
layer is removed and the substrate rinsed with nanopm-e water and dried with N2 gas. 
Various alignments may be formed using multiple chambers and/or microfludic channels, 
different patterns of microfludic components, different microfluidic streams and different 
structures within the channels. 

[0131] FIG. 10 and FIG. 11 show examples of lambda DNA molecules, ahgned by the 
MMC process. The fully stretched and aligned lambda DNA was about 17 fun in length. 
Molecules were aligned parallel to the direction of microfluidic flow, as expected. This 
result demonstrates the feasibihty of ahgning coded probes on a surface, either hybridized 
to a target or else hybridized and then released. The alignment of the coded probe 
molecules facilitates their imagmg and identification by SPM imaging techniques. 

Example 9. AFM Imagmg of Oligonucleotide Based Coded Probe 

[0132] In another non-limiting example, coded probes may be produced as a set of short 
oligonucleotide sequences hybridized together, as illustrated in FIG. 12. Each line in the 
Figure represents a single synthetic oUgonucleotide, 9 on the top strand and 4 on the 
bottom strand. Hybridization creates branch points that may be imaged by SPM 
techniques. Alternatively, the branch points may serve as attachment sites for metal 
nanoparticles or other tag elements, as discussed above. An exemplary oligonucleotide 
coded probe sequence is provide in FIG. 13, showing the sequences of the top and bottom 
strands hybridized to each other. For clarity, the branch sequences are not shown in FIG. 
13. FIG 14 shows the complete sequences of the 9 separate oligonucleotides that form the 
top strand of the coded probe. The portions that hybridize to each other to fomi branch 
sites are indicated. For example, the 3' end of PTl (SEQ ID NO:3), labeled "A", 

38 



.2004038037A2_t_> 



wo 2004/038037 



PCTAJS2003/029726 



hybridizes to the 5' end of PT2 (SEQ ID NO:4), labeled "A'". Similarly, B binds to B', C 
binds to C\ etc. 

[0133] The exemplary coded probe was imaged by AFM techniques as discussed above. 
An AFM image of the coded probe is indicated by the arrow in FIG. 15. For comparison, 
5 a linearized 2.8 kb plasmid double-stranded DNA molecule is shown adjacent to the coded 
probe. 

* * * 

[0134] All of the METHODS, COMPOSITIONS and APPARATUS disclosed and 
claimed herein can be made and used without undue experimentation in hght of the 

10 present disclosure. It will be apparent to those of skill in the art that variations may be 
apphed to the METHODS, COMPOSITIONS and APPARATUS described herein without 
departing from tiie concept, spirit and scope of the claimed subject matter. More 
specifically, it will be apparent that certain agents that are related may be substituted for 
the agmts described herein while the same or similar results would be achieved. All such 

15 similar substitutes and modifications apparent to those skilled in the art are deemed to be 
within the spirit, scope and concept of the claimed subject matter. 
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Table 1. Exemplary Nanotag Elements 



Molecules 


Vender 


MW 
g/mol 


Distinguishable 
Features 


Fullerienes .; 








C60 


BuckyUSA 


720.6 


size, shape, low density 


C70 


BuckyUSA 


840.7 


size, shape, low density 


C84 


BuckyUSA 


1008.9 


size, shape, low density 


Metal Center Fullerenes 








• C60La 


BuckyUSA 


859.5 


size, shape, high 
electron density & 
charge 


C84La 


BuckyUSA 


1147.8 


size, shape, high 
electron density & 
charge 


C60Er 


BuckyUSA 


887.5 


size, shape, high 
electron density & 
charge 


C84Er 


BuckyUSA 


1176.8 


size, shape, high 
electron density & 
charge 


Fullerede Oxides ^ 






' ' ' •'' ■ d- • 


C60-O 


BuckyUSA 


736 


size, shape, low density 


C70-O 


BuckyUSA 


856 


size, shape, low density 


Bifiinctional Fiillerenes 








O-C60-O 


BuckyUSA 


752 


size, shape, low density 


O-C70-O 


BuckyUSA 


872 


size, shape, low density 


P.O.S.S.- Polyheclral oligomeric 
. -r silsesquioxane;:;.-; 


. Hybrid Pliastics 


.800-1600 


' - 800-1600 : " 


Octakis pentacyclo 
octasiloxane hydrate 


Aldrich 


1137 


Size, shape, charge (-) 


OctaAmmonium POSS 


Hybrid Plastics 




Size, shape, charge (+) 


Octalsobutyl POSS 


Hybrid Plastics 




size, shape 


OctaMethyl POSS 


Hybrid Plastics 




size, shape 


OctaTmAPOSS 


Hybrid Plastics 




size, shape, density 
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Table 1 (continued) 



Organomefallics 








Metal Centers include:Cr, Fe, 
Al, B, Co, Ni, Zr, Cu, Mg, Zn 
andRu. Organic moieties 
include any functionalizable 
structure including, 
sepulcrates, bipyridines, 
porphrines, corrins, EDTA, 
biphenyl, benzene, 
phthalocyanine, 
hematoporphyrin, heme, 
naphthalocyanine, 
phthalocyanine, 
Cyclopentadiene, Indene, 
Fluorene, Benzorndene, 4- 
Fluorophenyl, 4- 
Methoxypheny, Tris(4- 
chlorophenyl) and others 


Aldrich, Acros, 
Boulder 
Scientific 




Metal centers have 
different size of outer 
orbital, density, charge 
distribution, and redox 
states. The organic 
moieties impart size, 
shape and density 
characteristics. 


Cu n trifluoroacetyl acetate 


Aldrich 






Cu n phthalocyanine 


Aldrich 






Co n phthalocyanine 


Aldrich 






Fe n phthalocyanine 


Aldnch 






Zn n phthalocyanine 


Aldrich 






Ni n phthalocyanine 


Alancn 






Mg n phthalocyanine 


Aldrich 






Co n 2-3 naphthalocyanine 


Aldrich 






l,r-Ferrocenedicarboxylic acid 


Aldrich 


274.06 




Co HI sepulcrate trichloride 


Aldrich 






Cu n 2-pyrazinecarboxylate 


Aldrich 






Nano-crystal particle (Ag), 
NHS esters 


Nanoprobes 
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TABLE 2. Potential Subunits for Backbone Mediated Synthesis 



Caiiididate 




^SkJiUtwIllIlCIll. kVUtJPIlIUl .'.j 


"C66 


C60COOH 






:C7OC0OH 


Lysine , « . :. ' • 


ILaBuckey ;!; 


\ iLABuckyCOOH 


- ■ =■ ' • . ^; .:> = 
.•.:Lysine" \ • ' '•' ..liij:; i 




; ;C60cobH 


Ethyl ammo Thymidine Hi ^ 


;C7a. ■■ ^ 


ri: |C70COOH 


:,Ethyl,amino Thjomd!^ C: . 


ILaBuckey 


* LA Bucky COOH ■ : 


iEthyiirfiamno 


(Nm)8 p6ss 


■.|na' 


• Glutajnte oiv^g^ vli- 


[Metal Phalocymonine^ 


jCOOH 


L^ine or NH2-Thymicline IIP' ^ 



DVletal^Phalocyanonme ; j j; :,^JnH2 ' _ Glutamic or aspartic acid^S;: : 
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Table 3. Exemplary Subunits for Polymer Decoration 



Tag Element Mono- Attachment to polymer . Polymer sequence 

functionali ; subonit 

zed 



|C60 

C 7 0 . , 
La Buckey 
LaBu0key ; 

(M 

C70 
C70 

- V' 

JLa iBuckey, 



C60COOH Lysine 
C60COOH Lysine 



NH2-(Gly-Gly-GIy-Lys)8-COOH 
■< NH2-(A-A-A-A-A-A-K)7-COOH 



C7PCOOH Lysinei 
C70COOH Lysine- 



lNH2-(Gly-Gly-GIy-Lys)8-COOH 
NH2-(A-ArA-A-A-A-IQ7-:COOH 



LA Bucky Lysine 

COOH ? 

LA Bucky Lysine 
COOH 



NH2-(Gly-Gly-GIy-Lys)8-COOH 
;;NH2-(A-A-A-A-A-A-K)7-COPH 



C60COOH Ethyl amino Thynudine (X5 5'-(T-X)10-3' 

:C60COOH Ethyl anuno T^ymiSie ^ ;5*-(5p<^) vvdiere Q is 12 atom spacer 



|C70COOH EthylaminolTijOTidine (30 5'-^^ 

!G70COOH B&yl aimno ThpjMine'^^^^ Q isj2 atom spacer 



.LaBnckey - 

(pi2)8PpSS |na 

(NH2)8POSSMilA 

(NH2)8 POSS |NA ■ 

Mdtal ' COOH 
Phalocyanine . 
Metal - 'COOH 
Ph^bcyabine ■ 



LA Bucky iEthyl amino Thymidine (X) 5'-(T-X)10-3' 

COOH , ' ' ■ ■< 

LA Bucky E]thyi amino Thymidine PQ 'I 5'-(X-Q) where Q is 12 atom spacer 
.COOH- .>.;■: 



Glutamic or aspartic 'acid - :NH2-(Gly-Gly-Gly-GIu)8-COOH 
Gljitamic or>spaaSc acid "'NHikATA-A-A^^^ 



T carbbxylate analog (Y), 
Lysine . 



'5'-(T^10-3^ -f-. 
lSIH2-(A-A-A-A-A-A-K)7-COOH 



Lysine 



i^H2■<Gly7Gly-Gly-Lys)^t-COpH. 
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Table 4. Exemplary Subunits for Direct Polymer Imaging 



Subunit 

Lysine (K) 
Glutamic acid (E)' 
EandK " 
Br-T(Br). 
.. NH'2-T(N) 
Brand.N 
Phosphate and spiacers' 



■i-ii 



Polymer 



(A6-K)8_or (AAKAAAK)4 or KKKKKKK 
(A6-E)8 or (AAEAAAE)4 ot EEEEEE 



(AAKAAAE)4 

T-Br-T-Br-TTT-Br-TTT-Br-Br-f 



IT-N-T-N-TTT-N-TTT-N-N-T • 
:>,T-Br-T-N-T-Br-BrTTTT-N^N-Br-T 



TTT-3-9-3-3-9-9r3-9 
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CLAIMS 

What is claimed is: 

1 . A method comprising: 

a) obtaining one or more coded probes, each coded probe comprising a probe 
5 molecule attached to at least one nano-barcode; 

b) contacting one or more target molecules with the coded probes; 

c) organizing the coded probes that bind to the one or more target molecules; 

d) identifying the organized coded probes; and 

e) detecting the one or more target molecules based on the boxmd coded probes. 
10 2. The method of claim 1, wherein each coded probe comprises an oligonucleotide. 

3. The method of claim 2, wherein the target molecule is a nucleic acid. 

4. The method of claim 3, wherein a hbrary of coded probes comprising all possible 
sequences for a particular length of oligonucleotide is contacted with the target 
molecule. 

15 5. The method of claim 1, wherein the nano-barcode is selected from the group consisting 
of carbon nanotubes, fullorenes, submicrometer metallic barcodes, nanoparticles and 
quantum dots. 

6. The method of claim 3, wherein the nucleic acid is attached to a surface. 

7. The method of claim 6, further comprising ligating adjacent coded probes that are 
20 hybridized to the nucleic acid. 

8. The method of claim 7, further comprising separating ligated coded probes from the 
nucleic acid and non-ligated coded probes. 

9. The method of claim 1, further comprising aligning the coded probes on a surface by 
molecular combing. 

25 10. The m ethod o f c laim 1 , wherein the c oded p robes are i dentified b y s canning p robe 
microscopy. 
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11. The method of claim 10, wherein the scamiing probe microscopy technique is selected 
from the group consisting of atomic force microscopy, scanning tunneling microscopy, 
lateral force necroscopy, chemical force microscopy, force modulation imaging, 
magnetic force microscopy, high frequency magnetic force microscopy, 
magnetoresistive sensitivity mapping, electric force microscopy, scanning capacitance 
microscopy, scanning spreading resistance microscopy, tunneling atomic force 
microscopy and conductive atomic force microscopy. 

12. The method of claim 9, wherein the coded probes aligned on the surface are identified 
by scanning probe microscopy. 

13. The method of claim 12, ftirther comprising detemiining the sequences of 
oligonucleotides that bind to the nucleic acid. 

14. The method of claim 13, further comprising determining the sequence of the nucleic 
acid from the sequences of oligonucleotides that bind to the nucleic acid. 

15. The method of claun 3, fiulher comprising identifying the nucleic acid from the coded 
probes that bind to the nucleic acid. 

16. The method of claim 1, wherein the target molecule is a protein, a peptide, a 
glycoprotein, a lipoprotein, a nucleic acid, a polynucleotide, an oligonucleotide, a 
lipid, a glycolipid or a polysaccharide. 

17. The method of claim 16, wherein two or more target molecules are present in a sample 
and all target molecules in the sample are analyzed at the same time. 

18. The method of claim 16, wherein two or more target molecules are present in a sample 
and all target molecules of the same type are analyzed at the same time. 

19. A method comprising: 

a) obtaining one or more coded probes, each coded probe comprising a probe 
molecule attached to at least one nano-barcode; 

b) contacting one or more target molecules with the coded probes; 

c) aligning on a surface the coded probes that bind to the one or more target 
molecules; 

d) xising scanning probe microscopy to identify the aligned coded probes; and 
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e) detecting the one or more target molecules from the identified coded probes. 

20. The method of claim 19, wherein the coded probes are aUgned on a surface by 
molecular combing. 

21. The method of claim 19, wherein the scanning probe microscopy technique is selected 
5 from the group consisting of atomic force microscopy, scanning tunneling microscopy, 

lateral force microscopy, chemical force microscopy, force modulation imaging, 
magnetic force microscopy, high frequency magnetic force microscopy, 
magnetoresistive sensitivity mapping, electric force microscopy, scanning c^acitance 
microscopy, scanning spreading resistance microscopy, tunneling atonodc force 
10 microscopy and conductive atomic force microscopy. 

22. The method of claim 19, wherein the target molecule is a nucleic acid. 

23. The method of claim 22, further comprising determining at least part of the sequence 
of the nucleic acid from the boimd coded probes. 

24. The method of claim 19, fiirther comprising separating the boimd coded probes from 
15 the target molecules before the coded probes are aligned on a surface. 

25. A system for nucleic acid sequencing comprising: 

a) a scanning probe microscope; 

b) a surface; and 

c) at least one coded probe attached to the surface. 

20 26. The system of claim 25, wherem the coded probes are aligned on the surface by 
molecular combing. 

27. The system of claim 25, wherein the coded probes comprise ligated oligonucleotides. 

28. The system of claim 25, wherein the scanning probe microscope is an atomic force 
microscope or a scaiming tunneling microscope. 

25 
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FIG. 5 




BNSDOCID: <WQ__2004038037A2_L> 



wo 2004/038037 



6/15 



PCT/US2003/029726 



Fia6 




BNSDCXID: <WO__2004038037A2_I_> 



wo 2004/038037 PCTAJS2003/029726 

7/15 




BNSDOCID: <WOl__20M038037A2_L> 



wo 2004/038037 



8/15 



PCT/US2003/029726 



FIG. 8 
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FIG. 9 




Data type 
Z range 



Height 
1.500 nm 



2.00 [ixn 
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